LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis

نویسندگان

چکیده

Recent advances in document image analysis (DIA) have been primarily driven by the application of neural networks. Ideally, research outcomes could be easily deployed production and extended for further investigation. However, various factors like loosely organized codebases sophisticated model configurations complicate easy reuse important innovations a wide audience. Though there on-going efforts to improve reusability simplify deep learning (DL) development disciplines natural language processing computer vision, none them are optimized challenges domain DIA. This represents major gap existing toolkit, as DIA is central academic across range social sciences humanities. paper introduces LayoutParser, an open-source library streamlining usage DL applications. The core LayoutParser comes with set simple intuitive interfaces applying customizing models layout detection, character recognition, many other tasks. To promote extensibility, also incorporates community platform sharing both pre-trained full digitization pipelines. We demonstrate that helpful lightweight large-scale pipelines real-word use cases. publicly available at https://layout-parser.github.io.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deep Learning Based Document Theme Analysis for Composition Generation

This paper puts forward theme analysis problem in order to automatically solve composition writing questions in Chinese college entrance examination. Theme analysis is to distillate the embedded semantic information from the given materials or documents. We proposes a hierarchical neural network framework to address this problem. Two deep learning based models under the proposed framework are p...

متن کامل

Deep learning-based CAD systems for mammography: A review article

Breast cancer is one of the most common types of cancer in women. Screening mammography is a low‑dose X‑ray examination of breasts, which is conducted to detect breast cancer at early stages when the cancerous tumor is too small to be felt as a lump. Screening mammography is conducted for women with no symptoms of breast cancer, for early detection of cancer when the cancer is most treatable an...

متن کامل

A Toolkit for Analysis of Deep Learning Experiments

Learning experiments are complex procedures which generate high volumes of data due to the number of updates which occur during training and the number of trials necessary for hyper-parameter selection. Often during runtime, interim result data is purged as the experiment progresses. This purge makes rolling-back to interim experiments, restarting at a specific point or discovering trends and p...

متن کامل

Preference Learning for Document Image Analysis

We resort to preference learning in order to address the problem of acquiring necessary knowledge in two distinct steps of the document image analysis process: 1) reading order detection, and 2) document summarization. We advocate a relational approach for both cases and we propose a probabilistic relational learning method. Experiments on real data for both applications prove the effectiveness...

متن کامل

Learning Document Image Features With SqueezeNet Convolutional Neural Network

The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-86549-8_9